Modality-Adaptive Mixup and Invariant Decomposition for RGB-Infrared Person Re-identification

نویسندگان

چکیده

RGB-infrared person re-identification is an emerging cross-modality task, which very challenging due to significant modality discrepancy between RGB and infrared images. In this work, we propose a novel modality-adaptive mixup invariant decomposition (MID) approach for towards learning modality-invariant discriminative representations. MID designs scheme generate suitable mixed images mitigating the inherent at pixel-level. It formulates procedure as Markov decision process, where actor-critic agent learns dynamical local linear interpolation policy different regions of under deep reinforcement framework. Such guarantees modality-invariance in more continuous latent space avoids manifold intrusion by corrupted samples. Moreover, further counter enforce visual semantics feature-level, employs convolution disassemble regular layer into modality-specific basis layers modality-shared coefficient layer. Extensive experimental results on two benchmarks demonstrate superior performance over state-of-the-art methods.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Supplementary Material for “RGB-Infrared Cross-Modality Person Re-Identification”

This supplementary material accompanies the paper “RGB-Infrared Cross-Modality Person Re-Identification”. It includes more details of Section 4, as well as extra evaluations of our proposed deep zero-padding method. 1. Details of Counting Domain-Specific Nodes In the third paragraph of Section 4.2 in the main manuscript, we quantify the number of domain-specific nodes in the trained network in ...

متن کامل

Query Based Adaptive Re-ranking for Person Re-identification

Existing algorithms for person re-identification hardly model query variations across non-overlapping cameras. In this paper, we propose a query based adaptive re-ranking method to address this important issue. In our work, negative image pairs can be easily generated for each query under non-overlapping cameras. To infer query variations across cameras, nearest neighbors of the query positive ...

متن کامل

Pose Invariant Embedding for Deep Person Re-identification

Pedestrian misalignment, which mainly arises from detector errors and pose variations, is a critical problem for a robust person re-identification (re-ID) system. With bad alignment, the background noise will significantly compromise the feature learning and and matching process. To address this problem, this paper introduces the pose invariant embedding (PIE) as a pedestrian descriptor. First,...

متن کامل

Hierarchical Invariant Feature Learning with Marginalization for Person Re-Identification

This paper addresses the problem of matching pedestrians across multiple camera views, known as person re-identification. Variations in lighting conditions, environment and pose changes across camera views make re-identification a challenging problem. Previous methods address these challenges by designing specific features or by learning a distance function. We propose a hierarchical feature le...

متن کامل

View-Adaptive Metric Learning for Multi-view Person Re-identification

Person re-identification is a challenging problem due to drastic variations in viewpoint, illumination and pose. Most previous works on metric learning learn a global distance metric to handle those variations. Different from them, we propose a view-adaptive metric learning (VAML) method, which adopts different metrics adaptively for different image pairs under varying views. Specifically, give...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2022

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v36i1.19987